Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 400 |
| Missing cells | 3 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 87.5 KiB |
| Average record size in memory | 224.0 B |
Variable types
| Numeric | 23 |
|---|---|
| Categorical | 4 |
Word has a high cardinality: 400 distinct values | High cardinality |
Sentence has a high cardinality: 400 distinct values | High cardinality |
Concreteness is highly correlated with AoA | High correlation |
A priori Predictability is highly correlated with similarity and 3 other fields | High correlation |
OLD20 is highly correlated with #letters and 1 other fields | High correlation |
#letters is highly correlated with OLD20 and 1 other fields | High correlation |
OrthNeighSize is highly correlated with OLD20 and 1 other fields | High correlation |
BigramFreq is highly correlated with TrigramFreq | High correlation |
TrigramFreq is highly correlated with BigramFreq | High correlation |
Frequency is highly correlated with LogFreq(Zipf) | High correlation |
LogFreq(Zipf) is highly correlated with Frequency | High correlation |
similarity is highly correlated with A priori Predictability and 1 other fields | High correlation |
AoA is highly correlated with Concreteness | High correlation |
cloze is highly correlated with A priori Predictability and 2 other fields | High correlation |
Plausibility is highly correlated with A priori Predictability and 2 other fields | High correlation |
Predictability is highly correlated with A priori Predictability and 3 other fields | High correlation |
PRECEDING_Frequency is highly correlated with PRECEDING_LogFreq(Zipf) and 1 other fields | High correlation |
PRECEDING_LogFreq(Zipf) is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
LENprec is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
Concreteness is highly correlated with AoA | High correlation |
A priori Predictability is highly correlated with cloze and 2 other fields | High correlation |
BLP_rt is highly correlated with BLP_accuracy and 1 other fields | High correlation |
BLP_accuracy is highly correlated with BLP_rt | High correlation |
OLD20 is highly correlated with #letters and 1 other fields | High correlation |
#letters is highly correlated with OLD20 and 1 other fields | High correlation |
OrthNeighSize is highly correlated with OLD20 and 1 other fields | High correlation |
BigramFreq is highly correlated with TrigramFreq | High correlation |
TrigramFreq is highly correlated with BigramFreq | High correlation |
Frequency is highly correlated with LogFreq(Zipf) | High correlation |
LogFreq(Zipf) is highly correlated with BLP_rt and 1 other fields | High correlation |
similarity is highly correlated with Predictability | High correlation |
AoA is highly correlated with Concreteness | High correlation |
cloze is highly correlated with A priori Predictability and 2 other fields | High correlation |
Plausibility is highly correlated with A priori Predictability and 2 other fields | High correlation |
Predictability is highly correlated with A priori Predictability and 3 other fields | High correlation |
PRECEDING_Frequency is highly correlated with PRECEDING_LogFreq(Zipf) and 1 other fields | High correlation |
PRECEDING_LogFreq(Zipf) is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
LENprec is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
A priori Predictability is highly correlated with cloze and 2 other fields | High correlation |
OLD20 is highly correlated with #letters and 1 other fields | High correlation |
#letters is highly correlated with OLD20 and 1 other fields | High correlation |
OrthNeighSize is highly correlated with OLD20 and 1 other fields | High correlation |
Frequency is highly correlated with LogFreq(Zipf) | High correlation |
LogFreq(Zipf) is highly correlated with Frequency | High correlation |
cloze is highly correlated with A priori Predictability and 2 other fields | High correlation |
Plausibility is highly correlated with A priori Predictability and 1 other fields | High correlation |
Predictability is highly correlated with A priori Predictability and 1 other fields | High correlation |
PRECEDING_Frequency is highly correlated with PRECEDING_LogFreq(Zipf) and 1 other fields | High correlation |
PRECEDING_LogFreq(Zipf) is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
LENprec is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
Concreteness is highly correlated with AoA | High correlation |
SensorimotorStrength is highly correlated with AoA | High correlation |
A priori Predictability is highly correlated with cloze and 2 other fields | High correlation |
BLP_rt is highly correlated with BLP_accuracy and 2 other fields | High correlation |
BLP_accuracy is highly correlated with BLP_rt and 1 other fields | High correlation |
OLD20 is highly correlated with #letters and 1 other fields | High correlation |
#letters is highly correlated with OLD20 and 1 other fields | High correlation |
OrthNeighSize is highly correlated with OLD20 and 1 other fields | High correlation |
BigramFreq is highly correlated with TrigramFreq | High correlation |
TrigramFreq is highly correlated with BigramFreq | High correlation |
Frequency is highly correlated with LogFreq(Zipf) | High correlation |
LogFreq(Zipf) is highly correlated with BLP_rt and 2 other fields | High correlation |
AoA is highly correlated with Concreteness and 2 other fields | High correlation |
cloze is highly correlated with A priori Predictability and 2 other fields | High correlation |
Plausibility is highly correlated with A priori Predictability and 2 other fields | High correlation |
Predictability is highly correlated with A priori Predictability and 2 other fields | High correlation |
PRECEDING_Frequency is highly correlated with PRECEDING_LogFreq(Zipf) and 1 other fields | High correlation |
PRECEDING_LogFreq(Zipf) is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
LENprec is highly correlated with PRECEDING_Frequency and 1 other fields | High correlation |
ID is uniformly distributed | Uniform |
Word is uniformly distributed | Uniform |
Sentence is uniformly distributed | Uniform |
A priori Predictability is uniformly distributed | Uniform |
ID has unique values | Unique |
Word has unique values | Unique |
SensorimotorStrength has unique values | Unique |
Sentence has unique values | Unique |
BigramFreq has unique values | Unique |
TrigramFreq has unique values | Unique |
similarity has unique values | Unique |
Predictability has unique values | Unique |
OrthNeighSize has 121 (30.2%) zeros | Zeros |
cloze has 125 (31.2%) zeros | Zeros |
Reproduction
| Analysis started | 2022-07-01 13:22:56.247914 |
|---|---|
| Analysis finished | 2022-07-01 13:23:58.318952 |
| Duration | 1 minute and 2.07 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 299.5 |
| Minimum | 100 |
|---|---|
| Maximum | 499 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 119.95 |
| Q1 | 199.75 |
| median | 299.5 |
| Q3 | 399.25 |
| 95-th percentile | 479.05 |
| Maximum | 499 |
| Range | 399 |
| Interquartile range (IQR) | 199.5 |
Descriptive statistics
| Standard deviation | 115.6143013 |
|---|---|
| Coefficient of variation (CV) | 0.3860243783 |
| Kurtosis | -1.2 |
| Mean | 299.5 |
| Median Absolute Deviation (MAD) | 100 |
| Skewness | 0 |
| Sum | 119800 |
| Variance | 13366.66667 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 499 | 1 | 0.2% |
| 236 | 1 | 0.2% |
| 226 | 1 | 0.2% |
| 227 | 1 | 0.2% |
| 228 | 1 | 0.2% |
| 229 | 1 | 0.2% |
| 230 | 1 | 0.2% |
| 231 | 1 | 0.2% |
| 232 | 1 | 0.2% |
| 233 | 1 | 0.2% |
| Other values (390) | 390 |
| Value | Count | Frequency (%) |
| 100 | 1 | |
| 101 | 1 | |
| 102 | 1 | |
| 103 | 1 | |
| 104 | 1 | |
| 105 | 1 | |
| 106 | 1 | |
| 107 | 1 | |
| 108 | 1 | |
| 109 | 1 |
| Value | Count | Frequency (%) |
| 499 | 1 | |
| 498 | 1 | |
| 497 | 1 | |
| 496 | 1 | |
| 495 | 1 | |
| 494 | 1 | |
| 493 | 1 | |
| 492 | 1 | |
| 491 | 1 | |
| 490 | 1 |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| council | 1 |
|---|---|
| prank | 1 |
| cable | 1 |
| launch | 1 |
| sample | 1 |
| Other values (395) |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.5075 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2203 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 400 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | absence |
|---|---|
| 2nd row | accent |
| 3rd row | access |
| 4th row | action |
| 5th row | adult |
Common Values
| Value | Count | Frequency (%) |
| council | 1 | 0.2% |
| prank | 1 | 0.2% |
| cable | 1 | 0.2% |
| launch | 1 | 0.2% |
| sample | 1 | 0.2% |
| lift | 1 | 0.2% |
| sonnet | 1 | 0.2% |
| cellar | 1 | 0.2% |
| motive | 1 | 0.2% |
| method | 1 | 0.2% |
| Other values (390) | 390 |
Length
| Value | Count | Frequency (%) |
| cast | 1 | 0.2% |
| blast | 1 | 0.2% |
| autumn | 1 | 0.2% |
| review | 1 | 0.2% |
| ideal | 1 | 0.2% |
| scandal | 1 | 0.2% |
| verdict | 1 | 0.2% |
| cactus | 1 | 0.2% |
| mash | 1 | 0.2% |
| noise | 1 | 0.2% |
| Other values (390) | 390 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 295 | |
| a | 182 | 8.3% |
| r | 174 | 7.9% |
| t | 152 | 6.9% |
| o | 144 | 6.5% |
| c | 129 | 5.9% |
| s | 129 | 5.9% |
| l | 127 | 5.8% |
| n | 123 | 5.6% |
| i | 113 | 5.1% |
| Other values (16) | 635 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2203 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 295 | |
| a | 182 | 8.3% |
| r | 174 | 7.9% |
| t | 152 | 6.9% |
| o | 144 | 6.5% |
| c | 129 | 5.9% |
| s | 129 | 5.9% |
| l | 127 | 5.8% |
| n | 123 | 5.6% |
| i | 113 | 5.1% |
| Other values (16) | 635 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2203 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 295 | |
| a | 182 | 8.3% |
| r | 174 | 7.9% |
| t | 152 | 6.9% |
| o | 144 | 6.5% |
| c | 129 | 5.9% |
| s | 129 | 5.9% |
| l | 127 | 5.8% |
| n | 123 | 5.6% |
| i | 113 | 5.1% |
| Other values (16) | 635 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2203 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 295 | |
| a | 182 | 8.3% |
| r | 174 | 7.9% |
| t | 152 | 6.9% |
| o | 144 | 6.5% |
| c | 129 | 5.9% |
| s | 129 | 5.9% |
| l | 127 | 5.8% |
| n | 123 | 5.6% |
| i | 113 | 5.1% |
| Other values (16) | 635 |
| Distinct | 202 |
|---|---|
| Distinct (%) | 50.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2471 |
| Minimum | 1.19 |
|---|---|
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1.19 |
|---|---|
| 5-th percentile | 1.5985 |
| Q1 | 2.26 |
| median | 3.075 |
| Q3 | 4.4 |
| 95-th percentile | 4.96 |
| Maximum | 5 |
| Range | 3.81 |
| Interquartile range (IQR) | 2.14 |
Descriptive statistics
| Standard deviation | 1.154878761 |
|---|---|
| Coefficient of variation (CV) | 0.3556646734 |
| Kurtosis | -1.315021753 |
| Mean | 3.2471 |
| Median Absolute Deviation (MAD) | 0.905 |
| Skewness | 0.1174310161 |
| Sum | 1298.84 |
| Variance | 1.333744952 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 11 | 2.8% |
| 4.9 | 11 | 2.8% |
| 4.86 | 7 | 1.8% |
| 4.96 | 7 | 1.8% |
| 4.93 | 7 | 1.8% |
| 2.89 | 7 | 1.8% |
| 1.96 | 5 | 1.2% |
| 3.9 | 5 | 1.2% |
| 3.86 | 5 | 1.2% |
| 1.97 | 4 | 1.0% |
| Other values (192) | 331 |
| Value | Count | Frequency (%) |
| 1.19 | 1 | 0.2% |
| 1.33 | 1 | 0.2% |
| 1.34 | 1 | 0.2% |
| 1.37 | 1 | 0.2% |
| 1.41 | 1 | 0.2% |
| 1.44 | 1 | 0.2% |
| 1.45 | 1 | 0.2% |
| 1.47 | 1 | 0.2% |
| 1.5 | 2 | |
| 1.52 | 4 |
| Value | Count | Frequency (%) |
| 5 | 11 | |
| 4.97 | 4 | 1.0% |
| 4.96 | 7 | |
| 4.93 | 7 | |
| 4.92 | 2 | 0.5% |
| 4.91 | 1 | 0.2% |
| 4.9 | 11 | |
| 4.89 | 2 | 0.5% |
| 4.87 | 3 | 0.8% |
| 4.86 | 7 |
Valence
Real number (ℝ≥0)
| Distinct | 237 |
|---|---|
| Distinct (%) | 59.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.2222 |
| Minimum | 1.68 |
|---|---|
| Maximum | 8.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1.68 |
|---|---|
| 5-th percentile | 2.6295 |
| Q1 | 4.5 |
| median | 5.475 |
| Q3 | 6.15 |
| 95-th percentile | 7.142 |
| Maximum | 8.05 |
| Range | 6.37 |
| Interquartile range (IQR) | 1.65 |
Descriptive statistics
| Standard deviation | 1.34833385 |
|---|---|
| Coefficient of variation (CV) | 0.2581926869 |
| Kurtosis | -0.1422011341 |
| Mean | 5.2222 |
| Median Absolute Deviation (MAD) | 0.77 |
| Skewness | -0.6154958252 |
| Sum | 2088.88 |
| Variance | 1.81800417 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.5 | 7 | 1.8% |
| 6 | 6 | 1.5% |
| 5.9 | 6 | 1.5% |
| 5.95 | 6 | 1.5% |
| 4.89 | 5 | 1.2% |
| 5.29 | 5 | 1.2% |
| 6.45 | 5 | 1.2% |
| 5.86 | 5 | 1.2% |
| 5.68 | 4 | 1.0% |
| 5.16 | 4 | 1.0% |
| Other values (227) | 347 |
| Value | Count | Frequency (%) |
| 1.68 | 1 | |
| 1.79 | 1 | |
| 1.89 | 1 | |
| 1.91 | 1 | |
| 1.95 | 2 | |
| 2 | 1 | |
| 2.05 | 2 | |
| 2.11 | 1 | |
| 2.15 | 1 | |
| 2.29 | 1 |
| Value | Count | Frequency (%) |
| 8.05 | 1 | |
| 7.94 | 1 | |
| 7.81 | 1 | |
| 7.75 | 1 | |
| 7.72 | 1 | |
| 7.67 | 1 | |
| 7.63 | 1 | |
| 7.61 | 1 | |
| 7.53 | 1 | |
| 7.52 | 1 |
Arousal
Real number (ℝ≥0)
| Distinct | 219 |
|---|---|
| Distinct (%) | 54.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.160275 |
| Minimum | 2.15 |
|---|---|
| Maximum | 7.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 2.15 |
|---|---|
| 5-th percentile | 2.699 |
| Q1 | 3.41 |
| median | 4 |
| Q3 | 4.86 |
| 95-th percentile | 6.061 |
| Maximum | 7.05 |
| Range | 4.9 |
| Interquartile range (IQR) | 1.45 |
Descriptive statistics
| Standard deviation | 0.9998018222 |
|---|---|
| Coefficient of variation (CV) | 0.2403210899 |
| Kurtosis | -0.3079859129 |
| Mean | 4.160275 |
| Median Absolute Deviation (MAD) | 0.67 |
| Skewness | 0.4901026229 |
| Sum | 1664.11 |
| Variance | 0.9996036836 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 10 | 2.5% |
| 3.45 | 6 | 1.5% |
| 4.5 | 6 | 1.5% |
| 4.05 | 6 | 1.5% |
| 3.67 | 5 | 1.2% |
| 3.2 | 5 | 1.2% |
| 3.1 | 5 | 1.2% |
| 3.05 | 5 | 1.2% |
| 3.95 | 5 | 1.2% |
| 3.9 | 5 | 1.2% |
| Other values (209) | 342 |
| Value | Count | Frequency (%) |
| 2.15 | 1 | 0.2% |
| 2.19 | 1 | 0.2% |
| 2.21 | 1 | 0.2% |
| 2.24 | 1 | 0.2% |
| 2.33 | 1 | 0.2% |
| 2.35 | 2 | |
| 2.45 | 3 | |
| 2.48 | 1 | 0.2% |
| 2.5 | 1 | 0.2% |
| 2.53 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 7.05 | 1 | |
| 6.9 | 1 | |
| 6.85 | 1 | |
| 6.57 | 1 | |
| 6.55 | 1 | |
| 6.52 | 1 | |
| 6.43 | 1 | |
| 6.35 | 2 | |
| 6.31 | 1 | |
| 6.29 | 1 |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.629960794 |
| Minimum | 1.919279474 |
|---|---|
| Maximum | 7.223465009 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1.919279474 |
|---|---|
| 5-th percentile | 3.03245118 |
| Q1 | 4.067771543 |
| median | 4.697519846 |
| Q3 | 5.297323439 |
| 95-th percentile | 6.147203697 |
| Maximum | 7.223465009 |
| Range | 5.304185535 |
| Interquartile range (IQR) | 1.229551896 |
Descriptive statistics
| Standard deviation | 0.9312108076 |
|---|---|
| Coefficient of variation (CV) | 0.2011271475 |
| Kurtosis | 0.1137164603 |
| Mean | 4.629960794 |
| Median Absolute Deviation (MAD) | 0.602883567 |
| Skewness | -0.1291562511 |
| Sum | 1851.984318 |
| Variance | 0.8671535682 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.179454741 | 1 | 0.2% |
| 4.703201138 | 1 | 0.2% |
| 5.117779648 | 1 | 0.2% |
| 4.80869487 | 1 | 0.2% |
| 5.350037789 | 1 | 0.2% |
| 4.803955431 | 1 | 0.2% |
| 5.543075503 | 1 | 0.2% |
| 3.023170671 | 1 | 0.2% |
| 6.153919536 | 1 | 0.2% |
| 6.146850232 | 1 | 0.2% |
| Other values (390) | 390 |
| Value | Count | Frequency (%) |
| 1.919279474 | 1 | |
| 1.922645011 | 1 | |
| 2.259631999 | 1 | |
| 2.290004786 | 1 | |
| 2.303020674 | 1 | |
| 2.458922092 | 1 | |
| 2.600238524 | 1 | |
| 2.626259863 | 1 | |
| 2.753884115 | 1 | |
| 2.759090184 | 1 |
| Value | Count | Frequency (%) |
| 7.223465009 | 1 | |
| 7.049364856 | 1 | |
| 6.937079755 | 1 | |
| 6.904950122 | 1 | |
| 6.865075781 | 1 | |
| 6.827404578 | 1 | |
| 6.728095325 | 1 | |
| 6.675426055 | 1 | |
| 6.505632181 | 1 | |
| 6.482682912 | 1 |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| The bank refused my application for a loan for the second time. | 1 |
|---|---|
| Anyone found wandering the streets after curfew could face sanctions. | 1 |
| He has sensitive skin and soap gives him an awful itch on his hands. | 1 |
| After the relocation, she had to adapt to the new class and school. | 1 |
| Since late childhood, he has had a considerable complex about his looks. | 1 |
| Other values (395) |
Length
| Max length | 94 |
|---|---|
| Median length | 80 |
| Mean length | 67.73 |
| Min length | 45 |
Characters and Unicode
| Total characters | 27092 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 400 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | The school called because of the student's unauthorised absence for two consecutive days. |
|---|---|
| 2nd row | People from Birmingham have the most recognizable accent from the United Kingdom. |
| 3rd row | Men and women should have equal access to education and employment. |
| 4th row | It is time to turn ideas into action and make the plan happen. |
| 5th row | Anyone over eighteen years of age counts as an adult according to the law. |
Common Values
| Value | Count | Frequency (%) |
| The bank refused my application for a loan for the second time. | 1 | 0.2% |
| Anyone found wandering the streets after curfew could face sanctions. | 1 | 0.2% |
| He has sensitive skin and soap gives him an awful itch on his hands. | 1 | 0.2% |
| After the relocation, she had to adapt to the new class and school. | 1 | 0.2% |
| Since late childhood, he has had a considerable complex about his looks. | 1 | 0.2% |
| They became close friends after a quarrel they had on a bus. | 1 | 0.2% |
| There is a statue of King Henry VIII who is the founder of Trinity College in Cambridge. | 1 | 0.2% |
| On Sunday morning, he reported the theft of his car to the police. | 1 | 0.2% |
| Everybody at the park wore a colourful badge with their name on. | 1 | 0.2% |
| When doing gardening grandma wears an apron to protect her clothes. | 1 | 0.2% |
| Other values (390) | 390 |
Length
| Value | Count | Frequency (%) |
| the | 521 | 10.8% |
| a | 193 | 4.0% |
| of | 154 | 3.2% |
| to | 110 | 2.3% |
| in | 87 | 1.8% |
| and | 65 | 1.3% |
| is | 63 | 1.3% |
| his | 59 | 1.2% |
| for | 59 | 1.2% |
| was | 54 | 1.1% |
| Other values (1938) | 3468 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4433 | ||
| e | 3000 | |
| t | 1891 | 7.0% |
| a | 1740 | 6.4% |
| o | 1609 | 5.9% |
| r | 1416 | 5.2% |
| n | 1408 | 5.2% |
| i | 1379 | 5.1% |
| s | 1367 | 5.0% |
| h | 1360 | 5.0% |
| Other values (53) | 7489 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21665 | |
| Space Separator | 4433 | 16.4% |
| Other Punctuation | 498 | 1.8% |
| Uppercase Letter | 479 | 1.8% |
| Decimal Number | 9 | < 0.1% |
| Dash Punctuation | 6 | < 0.1% |
| Currency Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3000 | |
| t | 1891 | 8.7% |
| a | 1740 | 8.0% |
| o | 1609 | 7.4% |
| r | 1416 | 6.5% |
| n | 1408 | 6.5% |
| i | 1379 | 6.4% |
| s | 1367 | 6.3% |
| h | 1360 | 6.3% |
| l | 830 | 3.8% |
| Other values (16) | 5665 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 192 | |
| H | 39 | 8.1% |
| A | 39 | 8.1% |
| S | 32 | 6.7% |
| I | 30 | 6.3% |
| W | 23 | 4.8% |
| C | 16 | 3.3% |
| M | 15 | 3.1% |
| B | 13 | 2.7% |
| D | 9 | 1.9% |
| Other values (14) | 71 | 14.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 2 | 2 | |
| 8 | 2 | |
| 5 | 1 | |
| 3 | 1 | |
| 0 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 398 | |
| , | 71 | 14.3% |
| ' | 29 | 5.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 1 | |
| $ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4433 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22144 | |
| Common | 4948 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3000 | |
| t | 1891 | 8.5% |
| a | 1740 | 7.9% |
| o | 1609 | 7.3% |
| r | 1416 | 6.4% |
| n | 1408 | 6.4% |
| i | 1379 | 6.2% |
| s | 1367 | 6.2% |
| h | 1360 | 6.1% |
| l | 830 | 3.7% |
| Other values (40) | 6144 |
Common
| Value | Count | Frequency (%) |
| 4433 | ||
| . | 398 | 8.0% |
| , | 71 | 1.4% |
| ' | 29 | 0.6% |
| - | 6 | 0.1% |
| 1 | 2 | < 0.1% |
| 2 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| £ | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| Other values (3) | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27091 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4433 | ||
| e | 3000 | |
| t | 1891 | 7.0% |
| a | 1740 | 6.4% |
| o | 1609 | 5.9% |
| r | 1416 | 5.2% |
| n | 1408 | 5.2% |
| i | 1379 | 5.1% |
| s | 1367 | 5.0% |
| h | 1360 | 5.0% |
| Other values (52) | 7488 |
None
| Value | Count | Frequency (%) |
| £ | 1 |
A priori Predictability
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORM| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 400 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 200 | |
| 1 | 200 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 0 | 200 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 0 | 200 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 400 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 0 | 200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 400 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 0 | 200 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 200 | |
| 0 | 200 |
| Distinct | 387 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 563.309475 |
| Minimum | 485.65 |
|---|---|
| Maximum | 762.82 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 485.65 |
|---|---|
| 5-th percentile | 507.7205 |
| Q1 | 532.7375 |
| median | 555.35 |
| Q3 | 584.1 |
| 95-th percentile | 648.654 |
| Maximum | 762.82 |
| Range | 277.17 |
| Interquartile range (IQR) | 51.3625 |
Descriptive statistics
| Standard deviation | 42.95338759 |
|---|---|
| Coefficient of variation (CV) | 0.07625184645 |
| Kurtosis | 2.697739379 |
| Mean | 563.309475 |
| Median Absolute Deviation (MAD) | 25.42 |
| Skewness | 1.273456667 |
| Sum | 225323.79 |
| Variance | 1844.993506 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 527.18 | 2 | 0.5% |
| 550.42 | 2 | 0.5% |
| 535.18 | 2 | 0.5% |
| 592.08 | 2 | 0.5% |
| 577.21 | 2 | 0.5% |
| 584.16 | 2 | 0.5% |
| 524.66 | 2 | 0.5% |
| 581.03 | 2 | 0.5% |
| 575.85 | 2 | 0.5% |
| 564.03 | 2 | 0.5% |
| Other values (377) | 380 |
| Value | Count | Frequency (%) |
| 485.65 | 1 | |
| 491.93 | 1 | |
| 493.75 | 1 | |
| 495.55 | 1 | |
| 497.9 | 1 | |
| 498.32 | 1 | |
| 499.75 | 1 | |
| 500.36 | 1 | |
| 502.59 | 1 | |
| 503.66 | 1 |
| Value | Count | Frequency (%) |
| 762.82 | 1 | |
| 751.4 | 1 | |
| 746.17 | 1 | |
| 725.62 | 1 | |
| 690.59 | 1 | |
| 688.88 | 1 | |
| 685.33 | 1 | |
| 667.78 | 1 | |
| 667.13 | 1 | |
| 665.14 | 1 |
| Distinct | 20 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.971275 |
| Minimum | 0.39 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0.39 |
|---|---|
| 5-th percentile | 0.89 |
| Q1 | 0.97 |
| median | 0.98 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 0.61 |
| Interquartile range (IQR) | 0.03 |
Descriptive statistics
| Standard deviation | 0.05856790823 |
|---|---|
| Coefficient of variation (CV) | 0.06030002649 |
| Kurtosis | 42.15539432 |
| Mean | 0.971275 |
| Median Absolute Deviation (MAD) | 0.02 |
| Skewness | -5.576393994 |
| Sum | 388.51 |
| Variance | 0.003430199875 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 197 | |
| 0.97 | 56 | 14.0% |
| 0.98 | 55 | 13.8% |
| 0.95 | 47 | 11.8% |
| 0.93 | 10 | 2.5% |
| 0.92 | 8 | 2.0% |
| 0.89 | 7 | 1.8% |
| 0.9 | 6 | 1.5% |
| 0.88 | 2 | 0.5% |
| 0.87 | 2 | 0.5% |
| Other values (10) | 10 | 2.5% |
| Value | Count | Frequency (%) |
| 0.39 | 1 | |
| 0.48 | 1 | |
| 0.63 | 1 | |
| 0.66 | 1 | |
| 0.7 | 1 | |
| 0.71 | 1 | |
| 0.79 | 1 | |
| 0.8 | 1 | |
| 0.84 | 1 | |
| 0.85 | 1 |
| Value | Count | Frequency (%) |
| 1 | 197 | |
| 0.98 | 55 | 13.8% |
| 0.97 | 56 | 14.0% |
| 0.95 | 47 | 11.8% |
| 0.93 | 10 | 2.5% |
| 0.92 | 8 | 2.0% |
| 0.9 | 6 | 1.5% |
| 0.89 | 7 | 1.8% |
| 0.88 | 2 | 0.5% |
| 0.87 | 2 | 0.5% |
| Distinct | 41 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.86175 |
| Minimum | 1 |
|---|---|
| Maximum | 3.45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.15 |
| Q1 | 1.65 |
| median | 1.85 |
| Q3 | 2 |
| 95-th percentile | 2.6525 |
| Maximum | 3.45 |
| Range | 2.45 |
| Interquartile range (IQR) | 0.35 |
Descriptive statistics
| Standard deviation | 0.4244258247 |
|---|---|
| Coefficient of variation (CV) | 0.227971438 |
| Kurtosis | 0.5518289366 |
| Mean | 1.86175 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 0.5344537383 |
| Sum | 744.7 |
| Variance | 0.1801372807 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.9 | 41 | 10.2% |
| 1.85 | 33 | 8.2% |
| 1.8 | 29 | 7.2% |
| 1.65 | 27 | 6.8% |
| 1.7 | 26 | 6.5% |
| 1.95 | 23 | 5.8% |
| 1.75 | 21 | 5.2% |
| 1.6 | 13 | 3.2% |
| 1.45 | 13 | 3.2% |
| 1.5 | 12 | 3.0% |
| Other values (31) | 162 |
| Value | Count | Frequency (%) |
| 1 | 10 | |
| 1.05 | 4 | 1.0% |
| 1.1 | 5 | 1.2% |
| 1.15 | 3 | 0.8% |
| 1.2 | 5 | 1.2% |
| 1.25 | 1 | 0.2% |
| 1.3 | 6 | |
| 1.35 | 7 | |
| 1.4 | 6 | |
| 1.45 | 13 |
| Value | Count | Frequency (%) |
| 3.45 | 1 | 0.2% |
| 3.2 | 1 | 0.2% |
| 2.95 | 1 | 0.2% |
| 2.85 | 5 | |
| 2.8 | 7 | |
| 2.75 | 3 | |
| 2.7 | 2 | 0.5% |
| 2.65 | 7 | |
| 2.6 | 5 | |
| 2.55 | 6 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| 6 | |
|---|---|
| 5 | |
| 4 | |
| 7 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 400 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 7 |
|---|---|
| 2nd row | 6 |
| 3rd row | 6 |
| 4th row | 6 |
| 5th row | 5 |
Common Values
| Value | Count | Frequency (%) |
| 6 | 126 | |
| 5 | 96 | |
| 4 | 93 | |
| 7 | 85 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 6 | 126 | |
| 5 | 96 | |
| 4 | 93 | |
| 7 | 85 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 126 | |
| 5 | 96 | |
| 4 | 93 | |
| 7 | 85 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 400 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 126 | |
| 5 | 96 | |
| 4 | 93 | |
| 7 | 85 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 400 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 126 | |
| 5 | 96 | |
| 4 | 93 | |
| 7 | 85 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 126 | |
| 5 | 96 | |
| 4 | 93 | |
| 7 | 85 |
OrthNeighSize
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 20 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.34 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 121 |
| Zeros (%) | 30.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 12.05 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.136081227 |
|---|---|
| Coefficient of variation (CV) | 1.238347673 |
| Kurtosis | 2.282737497 |
| Mean | 3.34 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.630929123 |
| Sum | 1336 |
| Variance | 17.10716792 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 121 | |
| 1 | 65 | |
| 2 | 47 | 11.8% |
| 3 | 32 | 8.0% |
| 4 | 27 | 6.8% |
| 5 | 22 | 5.5% |
| 6 | 16 | 4.0% |
| 9 | 11 | 2.8% |
| 8 | 10 | 2.5% |
| 7 | 10 | 2.5% |
| Other values (10) | 39 | 9.8% |
| Value | Count | Frequency (%) |
| 0 | 121 | |
| 1 | 65 | |
| 2 | 47 | 11.8% |
| 3 | 32 | 8.0% |
| 4 | 27 | 6.8% |
| 5 | 22 | 5.5% |
| 6 | 16 | 4.0% |
| 7 | 10 | 2.5% |
| 8 | 10 | 2.5% |
| 9 | 11 | 2.8% |
| Value | Count | Frequency (%) |
| 19 | 2 | 0.5% |
| 18 | 2 | 0.5% |
| 17 | 2 | 0.5% |
| 16 | 3 | 0.8% |
| 15 | 4 | |
| 14 | 3 | 0.8% |
| 13 | 4 | |
| 12 | 5 | |
| 11 | 8 | |
| 10 | 6 |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21460.18253 |
| Minimum | 3221.93 |
|---|---|
| Maximum | 68055.67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 3221.93 |
|---|---|
| 5-th percentile | 7896.938 |
| Q1 | 14415.2575 |
| median | 19768.08 |
| Q3 | 26636.9575 |
| 95-th percentile | 38808.521 |
| Maximum | 68055.67 |
| Range | 64833.74 |
| Interquartile range (IQR) | 12221.7 |
Descriptive statistics
| Standard deviation | 10782.09973 |
|---|---|
| Coefficient of variation (CV) | 0.5024234866 |
| Kurtosis | 3.378863088 |
| Mean | 21460.18253 |
| Median Absolute Deviation (MAD) | 5993.465 |
| Skewness | 1.450486641 |
| Sum | 8584073.01 |
| Variance | 116253674.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25013.72 | 1 | 0.2% |
| 29291.25 | 1 | 0.2% |
| 25204.16 | 1 | 0.2% |
| 8269.63 | 1 | 0.2% |
| 21761.69 | 1 | 0.2% |
| 27721.02 | 1 | 0.2% |
| 26052.99 | 1 | 0.2% |
| 60646.85 | 1 | 0.2% |
| 9455.39 | 1 | 0.2% |
| 15114.83 | 1 | 0.2% |
| Other values (390) | 390 |
| Value | Count | Frequency (%) |
| 3221.93 | 1 | |
| 3307.12 | 1 | |
| 4308.27 | 1 | |
| 4785.5 | 1 | |
| 5155.13 | 1 | |
| 5506.73 | 1 | |
| 5880.87 | 1 | |
| 5955.39 | 1 | |
| 6248.58 | 1 | |
| 6518.46 | 1 |
| Value | Count | Frequency (%) |
| 68055.67 | 1 | |
| 67707.22 | 1 | |
| 64482.66 | 1 | |
| 62838.29 | 1 | |
| 62460.46 | 1 | |
| 60646.85 | 1 | |
| 59070.44 | 1 | |
| 58221.62 | 1 | |
| 57883.95 | 1 | |
| 51688.97 | 1 |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2344.831775 |
| Minimum | 18.34 |
|---|---|
| Maximum | 28695.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 18.34 |
|---|---|
| 5-th percentile | 239.4355 |
| Q1 | 842.98 |
| median | 1751.2 |
| Q3 | 2763.01 |
| 95-th percentile | 6765.454 |
| Maximum | 28695.4 |
| Range | 28677.06 |
| Interquartile range (IQR) | 1920.03 |
Descriptive statistics
| Standard deviation | 2814.404135 |
|---|---|
| Coefficient of variation (CV) | 1.200258443 |
| Kurtosis | 35.39084443 |
| Mean | 2344.831775 |
| Median Absolute Deviation (MAD) | 940.25 |
| Skewness | 4.979486189 |
| Sum | 937932.71 |
| Variance | 7920870.637 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 286.55 | 1 | 0.2% |
| 2177.94 | 1 | 0.2% |
| 1619.93 | 1 | 0.2% |
| 1635.81 | 1 | 0.2% |
| 3218.08 | 1 | 0.2% |
| 75.56 | 1 | 0.2% |
| 2927.42 | 1 | 0.2% |
| 885.22 | 1 | 0.2% |
| 720.37 | 1 | 0.2% |
| 5300.74 | 1 | 0.2% |
| Other values (390) | 390 |
| Value | Count | Frequency (%) |
| 18.34 | 1 | |
| 24.51 | 1 | |
| 40.61 | 1 | |
| 75.56 | 1 | |
| 78.89 | 1 | |
| 84.04 | 1 | |
| 84.36 | 1 | |
| 91.95 | 1 | |
| 112.05 | 1 | |
| 113.81 | 1 |
| Value | Count | Frequency (%) |
| 28695.4 | 1 | |
| 22300.11 | 1 | |
| 21745.33 | 1 | |
| 21720.47 | 1 | |
| 9196.01 | 1 | |
| 9066.99 | 1 | |
| 8920.38 | 1 | |
| 8900.71 | 1 | |
| 8856.11 | 1 | |
| 8738.11 | 1 |
| Distinct | 391 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5816.58 |
| Minimum | 81 |
|---|---|
| Maximum | 51028 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 81 |
|---|---|
| 5-th percentile | 320.65 |
| Q1 | 1282.5 |
| median | 3155.5 |
| Q3 | 7662.75 |
| 95-th percentile | 20840.1 |
| Maximum | 51028 |
| Range | 50947 |
| Interquartile range (IQR) | 6380.25 |
Descriptive statistics
| Standard deviation | 7390.535401 |
|---|---|
| Coefficient of variation (CV) | 1.270598084 |
| Kurtosis | 9.381372627 |
| Mean | 5816.58 |
| Median Absolute Deviation (MAD) | 2363.5 |
| Skewness | 2.712438759 |
| Sum | 2326632 |
| Variance | 54620013.51 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1789 | 2 | 0.5% |
| 8871 | 2 | 0.5% |
| 4188 | 2 | 0.5% |
| 735 | 2 | 0.5% |
| 5599 | 2 | 0.5% |
| 277 | 2 | 0.5% |
| 2508 | 2 | 0.5% |
| 3295 | 2 | 0.5% |
| 3838 | 2 | 0.5% |
| 12466 | 1 | 0.2% |
| Other values (381) | 381 |
| Value | Count | Frequency (%) |
| 81 | 1 | |
| 99 | 1 | |
| 101 | 1 | |
| 120 | 1 | |
| 123 | 1 | |
| 153 | 1 | |
| 161 | 1 | |
| 187 | 1 | |
| 189 | 1 | |
| 198 | 1 |
| Value | Count | Frequency (%) |
| 51028 | 1 | |
| 45591 | 1 | |
| 42777 | 1 | |
| 38917 | 1 | |
| 36799 | 1 | |
| 35653 | 1 | |
| 34085 | 1 | |
| 29909 | 1 | |
| 27790 | 1 | |
| 27593 | 1 |
| Distinct | 190 |
|---|---|
| Distinct (%) | 47.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.1606 |
| Minimum | 2.61 |
|---|---|
| Maximum | 5.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 2.61 |
|---|---|
| 5-th percentile | 3.1985 |
| Q1 | 3.8075 |
| median | 4.195 |
| Q3 | 4.58 |
| 95-th percentile | 5.0105 |
| Maximum | 5.4 |
| Range | 2.79 |
| Interquartile range (IQR) | 0.7725 |
Descriptive statistics
| Standard deviation | 0.5507881252 |
|---|---|
| Coefficient of variation (CV) | 0.1323818981 |
| Kurtosis | -0.2898826264 |
| Mean | 4.1606 |
| Median Absolute Deviation (MAD) | 0.385 |
| Skewness | -0.2857956938 |
| Sum | 1664.24 |
| Variance | 0.3033675589 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.44 | 6 | 1.5% |
| 4.85 | 6 | 1.5% |
| 4.3 | 6 | 1.5% |
| 4.72 | 6 | 1.5% |
| 3.92 | 5 | 1.2% |
| 4.49 | 5 | 1.2% |
| 4.05 | 5 | 1.2% |
| 4.32 | 5 | 1.2% |
| 3.79 | 5 | 1.2% |
| 4.16 | 5 | 1.2% |
| Other values (180) | 346 |
| Value | Count | Frequency (%) |
| 2.61 | 1 | |
| 2.7 | 2 | |
| 2.78 | 1 | |
| 2.79 | 1 | |
| 2.88 | 1 | |
| 2.91 | 1 | |
| 2.97 | 2 | |
| 2.99 | 1 | |
| 3 | 1 | |
| 3.02 | 1 |
| Value | Count | Frequency (%) |
| 5.4 | 1 | |
| 5.35 | 1 | |
| 5.33 | 1 | |
| 5.29 | 1 | |
| 5.26 | 1 | |
| 5.25 | 1 | |
| 5.23 | 1 | |
| 5.17 | 1 | |
| 5.14 | 2 | |
| 5.13 | 1 |
| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1520799273 |
| Minimum | 0.007329558954 |
|---|---|
| Maximum | 0.5481328368 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0.007329558954 |
|---|---|
| 5-th percentile | 0.04445617166 |
| Q1 | 0.1011061096 |
| median | 0.1437613396 |
| Q3 | 0.1906218352 |
| 95-th percentile | 0.2947753156 |
| Maximum | 0.5481328368 |
| Range | 0.5408032779 |
| Interquartile range (IQR) | 0.08951572562 |
Descriptive statistics
| Standard deviation | 0.07673245041 |
|---|---|
| Coefficient of variation (CV) | 0.5045534395 |
| Kurtosis | 1.929525687 |
| Mean | 0.1520799273 |
| Median Absolute Deviation (MAD) | 0.04535981081 |
| Skewness | 0.9822248161 |
| Sum | 60.83197094 |
| Variance | 0.005887868946 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.08679235354 | 1 | 0.2% |
| 0.3606737033 | 1 | 0.2% |
| 0.1575486884 | 1 | 0.2% |
| 0.2895507862 | 1 | 0.2% |
| 0.2150050811 | 1 | 0.2% |
| 0.104243661 | 1 | 0.2% |
| 0.1051754293 | 1 | 0.2% |
| 0.1764343362 | 1 | 0.2% |
| 0.129634502 | 1 | 0.2% |
| 0.2624854166 | 1 | 0.2% |
| Other values (390) | 390 |
| Value | Count | Frequency (%) |
| 0.007329558954 | 1 | |
| 0.008078766366 | 1 | |
| 0.008852293715 | 1 | |
| 0.0131289158 | 1 | |
| 0.01367358677 | 1 | |
| 0.0229658857 | 1 | |
| 0.02362297475 | 1 | |
| 0.02446904033 | 1 | |
| 0.02784502537 | 1 | |
| 0.02808309998 | 1 |
| Value | Count | Frequency (%) |
| 0.5481328368 | 1 | |
| 0.4114860147 | 1 | |
| 0.394775331 | 1 | |
| 0.3880110482 | 1 | |
| 0.3662363589 | 1 | |
| 0.3640329639 | 1 | |
| 0.3606737033 | 1 | |
| 0.3564478844 | 1 | |
| 0.3455776498 | 1 | |
| 0.3375506699 | 1 |
| Distinct | 262 |
|---|---|
| Distinct (%) | 65.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8720169 |
| Minimum | 2.5 |
|---|---|
| Maximum | 15.56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 2.5 |
|---|---|
| 5-th percentile | 4.208 |
| Q1 | 6.17 |
| median | 7.76 |
| Q3 | 9.5225 |
| 95-th percentile | 11.9075 |
| Maximum | 15.56 |
| Range | 13.06 |
| Interquartile range (IQR) | 3.3525 |
Descriptive statistics
| Standard deviation | 2.332765258 |
|---|---|
| Coefficient of variation (CV) | 0.2963364139 |
| Kurtosis | -0.2470236004 |
| Mean | 7.8720169 |
| Median Absolute Deviation (MAD) | 1.69 |
| Skewness | 0.2427160435 |
| Sum | 3148.80676 |
| Variance | 5.441793751 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.11 | 5 | 1.2% |
| 6.05 | 5 | 1.2% |
| 6.58 | 5 | 1.2% |
| 9 | 5 | 1.2% |
| 7 | 4 | 1.0% |
| 6.89 | 4 | 1.0% |
| 8 | 4 | 1.0% |
| 5 | 4 | 1.0% |
| 6 | 4 | 1.0% |
| 5.42 | 3 | 0.8% |
| Other values (252) | 357 |
| Value | Count | Frequency (%) |
| 2.5 | 1 | |
| 2.9 | 1 | |
| 3 | 1 | |
| 3.23 | 1 | |
| 3.4 | 1 | |
| 3.47 | 1 | |
| 3.52 | 1 | |
| 3.56 | 1 | |
| 3.58 | 2 | |
| 3.63 | 2 |
| Value | Count | Frequency (%) |
| 15.56 | 1 | |
| 14.31 | 1 | |
| 14.26 | 1 | |
| 13.59 | 1 | |
| 13.39 | 1 | |
| 13.18 | 1 | |
| 13 | 2 | |
| 12.84 | 1 | |
| 12.53 | 1 | |
| 12.52 | 1 |
| Distinct | 113 |
|---|---|
| Distinct (%) | 28.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2755151414 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 125 |
| Zeros (%) | 31.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.1031914894 |
| Q3 | 0.54 |
| 95-th percentile | 0.9007446809 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.54 |
Descriptive statistics
| Standard deviation | 0.3232484173 |
|---|---|
| Coefficient of variation (CV) | 1.173251008 |
| Kurtosis | -0.7789513316 |
| Mean | 0.2755151414 |
| Median Absolute Deviation (MAD) | 0.1031914894 |
| Skewness | 0.8449501552 |
| Sum | 110.2060565 |
| Variance | 0.1044895393 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 125 | |
| 0.02 | 21 | 5.2% |
| 0.02127659574 | 14 | 3.5% |
| 0.6595744681 | 9 | 2.2% |
| 0.7 | 6 | 1.5% |
| 0.04 | 5 | 1.2% |
| 0.2 | 5 | 1.2% |
| 0.04255319149 | 5 | 1.2% |
| 0.3 | 5 | 1.2% |
| 0.02173913043 | 5 | 1.2% |
| Other values (103) | 200 |
| Value | Count | Frequency (%) |
| 0 | 125 | |
| 0.02 | 21 | 5.2% |
| 0.02127659574 | 14 | 3.5% |
| 0.02173913043 | 5 | 1.2% |
| 0.02222222222 | 1 | 0.2% |
| 0.04 | 5 | 1.2% |
| 0.04255319149 | 5 | 1.2% |
| 0.04347826087 | 1 | 0.2% |
| 0.06 | 4 | 1.0% |
| 0.06382978723 | 4 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.2% |
| 0.98 | 1 | 0.2% |
| 0.9787234043 | 1 | 0.2% |
| 0.9782608696 | 1 | 0.2% |
| 0.96 | 2 | |
| 0.9574468085 | 3 | |
| 0.94 | 4 | |
| 0.9361702128 | 2 | |
| 0.92 | 3 | |
| 0.914893617 | 2 |
| Distinct | 178 |
|---|---|
| Distinct (%) | 44.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.782090426 |
| Minimum | 2.8 |
|---|---|
| Maximum | 6.765957447 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 2.8 |
|---|---|
| 5-th percentile | 4.358 |
| Q1 | 5.468085106 |
| median | 5.968085106 |
| Q3 | 6.29787234 |
| 95-th percentile | 6.54 |
| Maximum | 6.765957447 |
| Range | 3.965957447 |
| Interquartile range (IQR) | 0.829787234 |
Descriptive statistics
| Standard deviation | 0.7000025904 |
|---|---|
| Coefficient of variation (CV) | 0.12106393 |
| Kurtosis | 2.054958348 |
| Mean | 5.782090426 |
| Median Absolute Deviation (MAD) | 0.3723404255 |
| Skewness | -1.397951111 |
| Sum | 2312.83617 |
| Variance | 0.4900036265 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.36 | 8 | 2.0% |
| 5.85106383 | 8 | 2.0% |
| 6.382978723 | 7 | 1.8% |
| 6.46 | 7 | 1.8% |
| 5.787234043 | 6 | 1.5% |
| 6.08 | 6 | 1.5% |
| 6 | 6 | 1.5% |
| 6.319148936 | 6 | 1.5% |
| 5.553191489 | 6 | 1.5% |
| 6.127659574 | 6 | 1.5% |
| Other values (168) | 334 |
| Value | Count | Frequency (%) |
| 2.8 | 1 | |
| 3.063829787 | 1 | |
| 3.18 | 1 | |
| 3.361702128 | 1 | |
| 3.68 | 1 | |
| 3.7 | 1 | |
| 3.74 | 1 | |
| 3.744680851 | 1 | |
| 3.787234043 | 1 | |
| 3.85106383 | 1 |
| Value | Count | Frequency (%) |
| 6.765957447 | 1 | 0.2% |
| 6.7 | 2 | |
| 6.680851064 | 2 | |
| 6.68 | 1 | 0.2% |
| 6.66 | 1 | 0.2% |
| 6.64 | 3 | |
| 6.638297872 | 2 | |
| 6.62 | 1 | 0.2% |
| 6.6 | 1 | 0.2% |
| 6.595744681 | 1 | 0.2% |
Position
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.455 |
| Minimum | 6 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 7 |
| median | 8 |
| Q3 | 9 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.548861657 |
|---|---|
| Coefficient of variation (CV) | 0.1831888417 |
| Kurtosis | -0.5505915459 |
| Mean | 8.455 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3672575643 |
| Sum | 3382 |
| Variance | 2.398972431 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 96 | |
| 7 | 85 | |
| 9 | 83 | |
| 10 | 53 | |
| 6 | 37 | 9.2% |
| 11 | 32 | 8.0% |
| 12 | 14 | 3.5% |
| Value | Count | Frequency (%) |
| 6 | 37 | 9.2% |
| 7 | 85 | |
| 8 | 96 | |
| 9 | 83 | |
| 10 | 53 | |
| 11 | 32 | 8.0% |
| 12 | 14 | 3.5% |
| Value | Count | Frequency (%) |
| 12 | 14 | 3.5% |
| 11 | 32 | 8.0% |
| 10 | 53 | |
| 9 | 83 | |
| 8 | 96 | |
| 7 | 85 | |
| 6 | 37 | 9.2% |
Predictability
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 400 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4327719026 |
| Minimum | 0.01636023074 |
|---|---|
| Maximum | 0.9999999404 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0.01636023074 |
|---|---|
| 5-th percentile | 0.1053831946 |
| Q1 | 0.1902351975 |
| median | 0.3598327637 |
| Q3 | 0.6722501367 |
| 95-th percentile | 0.9327819169 |
| Maximum | 0.9999999404 |
| Range | 0.9836397097 |
| Interquartile range (IQR) | 0.4820149392 |
Descriptive statistics
| Standard deviation | 0.2815373316 |
|---|---|
| Coefficient of variation (CV) | 0.6505443858 |
| Kurtosis | -1.056159663 |
| Mean | 0.4327719026 |
| Median Absolute Deviation (MAD) | 0.2048075125 |
| Skewness | 0.5476675896 |
| Sum | 173.108761 |
| Variance | 0.07926326906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.9999999404 | 1 | 0.2% |
| 0.2477178723 | 1 | 0.2% |
| 0.5606093407 | 1 | 0.2% |
| 0.7522636652 | 1 | 0.2% |
| 0.5618347526 | 1 | 0.2% |
| 0.1375296414 | 1 | 0.2% |
| 0.5042280555 | 1 | 0.2% |
| 0.1356614679 | 1 | 0.2% |
| 0.8759118915 | 1 | 0.2% |
| 0.2061114311 | 1 | 0.2% |
| Other values (390) | 390 |
| Value | Count | Frequency (%) |
| 0.01636023074 | 1 | |
| 0.05263656378 | 1 | |
| 0.06224362552 | 1 | |
| 0.06315302849 | 1 | |
| 0.06712187082 | 1 | |
| 0.07106456906 | 1 | |
| 0.07381238043 | 1 | |
| 0.07686175406 | 1 | |
| 0.08425715566 | 1 | |
| 0.08429520577 | 1 |
| Value | Count | Frequency (%) |
| 0.9999999404 | 1 | |
| 0.9886583686 | 1 | |
| 0.988374114 | 1 | |
| 0.986856699 | 1 | |
| 0.9862526655 | 1 | |
| 0.9850396514 | 1 | |
| 0.9711468816 | 1 | |
| 0.9678087831 | 1 | |
| 0.9662716389 | 1 | |
| 0.9647782445 | 1 |
PRECEDING_Frequency
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 177 |
|---|---|
| Distinct (%) | 44.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2505782.393 |
| Minimum | 11 |
|---|---|
| Maximum | 9418422 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 725.9 |
| Q1 | 11875.25 |
| median | 275160.5 |
| Q3 | 4809384 |
| 95-th percentile | 9418422 |
| Maximum | 9418422 |
| Range | 9418411 |
| Interquartile range (IQR) | 4797508.75 |
Descriptive statistics
| Standard deviation | 3634761.17 |
|---|---|
| Coefficient of variation (CV) | 1.45054941 |
| Kurtosis | -0.4162157297 |
| Mean | 2505782.393 |
| Median Absolute Deviation (MAD) | 271984.5 |
| Skewness | 1.125776366 |
| Sum | 1002312957 |
| Variance | 1.321148876 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9418422 | 74 | 18.5% |
| 4809384 | 41 | 10.2% |
| 502568 | 19 | 4.8% |
| 355000 | 11 | 2.8% |
| 3779783 | 10 | 2.5% |
| 504518 | 9 | 2.2% |
| 2950040 | 5 | 1.2% |
| 384298 | 5 | 1.2% |
| 191855 | 5 | 1.2% |
| 634923 | 4 | 1.0% |
| Other values (167) | 217 |
| Value | Count | Frequency (%) |
| 11 | 1 | |
| 63 | 1 | |
| 74 | 1 | |
| 90 | 1 | |
| 113 | 1 | |
| 161 | 1 | |
| 211 | 1 | |
| 277 | 1 | |
| 301 | 1 | |
| 318 | 1 |
| Value | Count | Frequency (%) |
| 9418422 | 74 | |
| 5327272 | 2 | 0.5% |
| 4809384 | 41 | |
| 3779783 | 10 | 2.5% |
| 2950040 | 5 | 1.2% |
| 1706951 | 2 | 0.5% |
| 1569081 | 1 | 0.2% |
| 1141430 | 3 | 0.8% |
| 850940 | 1 | 0.2% |
| 662093 | 1 | 0.2% |
PRECEDING_LogFreq(Zipf)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 140 |
|---|---|
| Distinct (%) | 35.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9237 |
| Minimum | 1.77 |
|---|---|
| Maximum | 7.67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1.77 |
|---|---|
| 5-th percentile | 3.559 |
| Q1 | 4.7725 |
| median | 6.135 |
| Q3 | 7.38 |
| 95-th percentile | 7.67 |
| Maximum | 7.67 |
| Range | 5.9 |
| Interquartile range (IQR) | 2.6075 |
Descriptive statistics
| Standard deviation | 1.415330779 |
|---|---|
| Coefficient of variation (CV) | 0.2389268158 |
| Kurtosis | -0.9715281163 |
| Mean | 5.9237 |
| Median Absolute Deviation (MAD) | 1.245 |
| Skewness | -0.3272292804 |
| Sum | 2369.48 |
| Variance | 2.003161213 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.67 | 74 | 18.5% |
| 7.38 | 41 | 10.2% |
| 6.4 | 28 | 7.0% |
| 6.25 | 11 | 2.8% |
| 7.27 | 10 | 2.5% |
| 5.53 | 6 | 1.5% |
| 6.28 | 6 | 1.5% |
| 5.98 | 5 | 1.2% |
| 7.17 | 5 | 1.2% |
| 5.13 | 5 | 1.2% |
| Other values (130) | 209 |
| Value | Count | Frequency (%) |
| 1.77 | 1 | |
| 2.5 | 1 | |
| 2.57 | 1 | |
| 2.65 | 1 | |
| 2.75 | 1 | |
| 2.91 | 1 | |
| 3.02 | 1 | |
| 3.14 | 1 | |
| 3.18 | 1 | |
| 3.2 | 1 |
| Value | Count | Frequency (%) |
| 7.67 | 74 | |
| 7.42 | 2 | 0.5% |
| 7.38 | 41 | |
| 7.27 | 10 | 2.5% |
| 7.17 | 5 | 1.2% |
| 6.93 | 2 | 0.5% |
| 6.89 | 1 | 0.2% |
| 6.75 | 3 | 0.8% |
| 6.63 | 1 | 0.2% |
| 6.52 | 1 | 0.2% |
| Distinct | 13 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.64 |
| Minimum | 1 |
|---|---|
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 13 |
| Range | 12 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.727452565 |
|---|---|
| Coefficient of variation (CV) | 0.5878130528 |
| Kurtosis | 0.02557280029 |
| Mean | 4.64 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.8114518117 |
| Sum | 1856 |
| Variance | 7.438997494 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 118 | |
| 5 | 47 | 11.8% |
| 1 | 41 | 10.2% |
| 6 | 36 | 9.0% |
| 2 | 33 | 8.2% |
| 7 | 30 | 7.5% |
| 4 | 30 | 7.5% |
| 9 | 24 | 6.0% |
| 8 | 18 | 4.5% |
| 10 | 9 | 2.2% |
| Other values (3) | 14 | 3.5% |
| Value | Count | Frequency (%) |
| 1 | 41 | 10.2% |
| 2 | 33 | 8.2% |
| 3 | 118 | |
| 4 | 30 | 7.5% |
| 5 | 47 | 11.8% |
| 6 | 36 | 9.0% |
| 7 | 30 | 7.5% |
| 8 | 18 | 4.5% |
| 9 | 24 | 6.0% |
| 10 | 9 | 2.2% |
| Value | Count | Frequency (%) |
| 13 | 2 | 0.5% |
| 12 | 6 | 1.5% |
| 11 | 6 | 1.5% |
| 10 | 9 | 2.2% |
| 9 | 24 | |
| 8 | 18 | 4.5% |
| 7 | 30 | |
| 6 | 36 | |
| 5 | 47 | |
| 4 | 30 |
SemD
Real number (ℝ≥0)
| Distinct | 397 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 3 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.696729559 |
| Minimum | 0.7431190179 |
|---|---|
| Maximum | 2.288068024 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0.7431190179 |
|---|---|
| 5-th percentile | 1.27302851 |
| Q1 | 1.541155657 |
| median | 1.701140455 |
| Q3 | 1.86927561 |
| 95-th percentile | 2.110891746 |
| Maximum | 2.288068024 |
| Range | 1.544949006 |
| Interquartile range (IQR) | 0.3281199526 |
Descriptive statistics
| Standard deviation | 0.2500447016 |
|---|---|
| Coefficient of variation (CV) | 0.1473686247 |
| Kurtosis | 0.4324548838 |
| Mean | 1.696729559 |
| Median Absolute Deviation (MAD) | 0.1653816912 |
| Skewness | -0.3404335013 |
| Sum | 673.6016348 |
| Variance | 0.0625223528 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.721522835 | 1 | 0.2% |
| 1.463104923 | 1 | 0.2% |
| 1.341346046 | 1 | 0.2% |
| 1.77631763 | 1 | 0.2% |
| 1.899283384 | 1 | 0.2% |
| 1.627462863 | 1 | 0.2% |
| 1.997061836 | 1 | 0.2% |
| 1.440146521 | 1 | 0.2% |
| 1.520338876 | 1 | 0.2% |
| 1.155537755 | 1 | 0.2% |
| Other values (387) | 387 | |
| (Missing) | 3 | 0.8% |
| Value | Count | Frequency (%) |
| 0.7431190179 | 1 | |
| 0.8086433361 | 1 | |
| 0.919511059 | 1 | |
| 1.025922224 | 1 | |
| 1.108316925 | 1 | |
| 1.11079054 | 1 | |
| 1.136698242 | 1 | |
| 1.147848633 | 1 | |
| 1.155537755 | 1 | |
| 1.168634492 | 1 |
| Value | Count | Frequency (%) |
| 2.288068024 | 1 | |
| 2.231346303 | 1 | |
| 2.205870035 | 1 | |
| 2.183398736 | 1 | |
| 2.177819422 | 1 | |
| 2.171636472 | 1 | |
| 2.166673748 | 1 | |
| 2.164793591 | 1 | |
| 2.161939832 | 1 | |
| 2.161783589 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| ID | Word | Concreteness | Valence | Arousal | SensorimotorStrength | Sentence | A priori Predictability | BLP_rt | BLP_accuracy | OLD20 | #letters | OrthNeighSize | BigramFreq | TrigramFreq | Frequency | LogFreq(Zipf) | similarity | AoA | cloze | Plausibility | Position | Predictability | PRECEDING_Frequency | PRECEDING_LogFreq(Zipf) | LENprec | SemD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 100 | absence | 2.31 | 3.86 | 4.30 | 3.974638 | The school called because of the student's unauthorised absence for two consecutive days. | 0 | 568.75 | 1.00 | 2.60 | 7 | 0 | 18473.38 | 2537.22 | 1558 | 3.89 | 0.062656 | 7.70 | 0.760000 | 6.460000 | 9 | 0.765174 | 301 | 3.18 | 12 | 2.161784 |
| 1 | 101 | accent | 3.26 | 6.48 | 4.95 | 4.940823 | People from Birmingham have the most recognizable accent from the United Kingdom. | 0 | 547.15 | 1.00 | 1.85 | 6 | 2 | 21569.24 | 4700.67 | 2884 | 4.16 | 0.124462 | 8.60 | 0.940000 | 5.500000 | 8 | 0.945914 | 11 | 1.77 | 12 | 1.662172 |
| 2 | 102 | access | 2.71 | 6.68 | 5.30 | 3.967939 | Men and women should have equal access to education and employment. | 1 | 585.97 | 0.98 | 2.00 | 6 | 0 | 16332.99 | 2611.02 | 11333 | 4.75 | 0.057037 | 9.10 | 0.000000 | 6.540000 | 7 | 0.274104 | 4090 | 4.31 | 5 | 1.919081 |
| 3 | 103 | action | 2.86 | 6.00 | 6.19 | 6.226654 | It is time to turn ideas into action and make the plan happen. | 0 | 518.13 | 1.00 | 1.85 | 6 | 0 | 23973.25 | 8212.90 | 24959 | 5.09 | 0.164773 | 6.67 | 0.440000 | 6.140000 | 8 | 0.394789 | 288299 | 6.16 | 4 | 2.097075 |
| 4 | 104 | adult | 4.40 | 5.90 | 4.36 | 4.841120 | Anyone over eighteen years of age counts as an adult according to the law. | 0 | 497.90 | 1.00 | 1.95 | 5 | 0 | 8625.75 | 647.78 | 5099 | 4.40 | 0.235083 | 4.68 | 0.872340 | 6.446809 | 10 | 0.936184 | 504518 | 6.40 | 2 | 1.703787 |
| 5 | 105 | advice | 2.73 | 5.78 | 3.05 | 4.983896 | That solicitor provides inexpensive legal advice to low-income families. | 0 | 527.51 | 1.00 | 2.25 | 6 | 1 | 11937.24 | 1019.99 | 14193 | 4.85 | 0.188326 | 8.61 | 0.702128 | 5.851064 | 6 | 0.749152 | 11309 | 4.75 | 5 | 1.952718 |
| 6 | 106 | affair | 2.45 | 3.10 | 5.40 | 4.554314 | The government acted carelessly about the whole affair bringing poverty in the country. | 1 | 551.18 | 1.00 | 2.50 | 6 | 0 | 7383.06 | 723.21 | 3986 | 4.30 | 0.090971 | 10.94 | 0.020000 | 5.500000 | 8 | 0.201068 | 70660 | 5.54 | 5 | 1.846129 |
| 7 | 107 | aisle | 4.35 | 5.17 | 2.50 | 4.434396 | The bride walked down the aisle with her dad. | 0 | 610.36 | 0.90 | 1.85 | 5 | 1 | 19086.91 | 334.21 | 1037 | 3.71 | 0.269695 | 5.95 | 0.978261 | 6.680851 | 6 | 0.841931 | 9418422 | 7.67 | 3 | 1.391624 |
| 8 | 108 | alarm | 4.47 | 3.86 | 6.85 | 5.713259 | I accidentally burnt my toast, which triggered the alarm to go off. | 0 | 534.11 | 1.00 | 1.95 | 5 | 0 | 20674.18 | 1154.46 | 3715 | 4.27 | 0.168610 | 6.39 | 0.340426 | 6.382979 | 9 | 0.816084 | 9418422 | 7.67 | 3 | 1.860275 |
| 9 | 109 | album | 4.69 | 6.19 | 5.63 | 5.543076 | The band has just released its new album which contains thirteen songs. | 0 | 589.89 | 0.95 | 2.25 | 5 | 0 | 10611.65 | 18.34 | 7091 | 4.55 | 0.294177 | 6.72 | 0.787234 | 6.340426 | 8 | 0.841453 | 191855 | 5.98 | 3 | 0.808643 |
Last rows
| ID | Word | Concreteness | Valence | Arousal | SensorimotorStrength | Sentence | A priori Predictability | BLP_rt | BLP_accuracy | OLD20 | #letters | OrthNeighSize | BigramFreq | TrigramFreq | Frequency | LogFreq(Zipf) | similarity | AoA | cloze | Plausibility | Position | Predictability | PRECEDING_Frequency | PRECEDING_LogFreq(Zipf) | LENprec | SemD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 390 | 490 | tune | 3.50 | 7.00 | 3.73 | 4.709302 | He cannot recall the exact tune of the song any more. | 1 | 528.34 | 0.98 | 1.40 | 4 | 8 | 14357.80 | 360.79 | 6167 | 4.49 | 0.091821 | 7.32 | 0.300000 | 5.460000 | 6 | 0.118414 | 3133 | 4.19 | 5 | 1.778941 |
| 391 | 491 | user | 3.16 | 3.67 | 3.21 | 3.265457 | The app was developed to suit the needs of the user when ordering food. | 0 | 550.47 | 0.89 | 1.75 | 4 | 3 | 36494.94 | 2898.87 | 801 | 3.60 | 0.192581 | 9.57 | 0.148936 | 6.021277 | 11 | 0.292560 | 9418422 | 7.67 | 3 | 1.318902 |
| 392 | 492 | value | 1.62 | 7.18 | 5.79 | 3.949182 | After a quick look, the police officer questioned the value of the item. | 1 | 512.45 | 1.00 | 1.65 | 5 | 3 | 10190.28 | 510.96 | 20365 | 5.00 | 0.032166 | 6.78 | 0.000000 | 5.160000 | 10 | 0.063153 | 9418422 | 7.67 | 3 | 2.064811 |
| 393 | 493 | venom | 4.62 | 2.93 | 5.81 | 3.023171 | Cobras are dangerous snakes because of the deadly venom they can inject. | 0 | 621.27 | 0.97 | 2.00 | 5 | 0 | 27322.97 | 1461.85 | 1147 | 3.76 | 0.266112 | 7.95 | 0.760000 | 6.600000 | 9 | 0.851998 | 6087 | 4.48 | 6 | 1.570181 |
| 394 | 494 | verdict | 2.19 | 4.32 | 5.63 | 5.455519 | The jury reached a unanimous verdict and the defendant was found not guilty. | 0 | 578.97 | 0.95 | 2.80 | 7 | 0 | 23329.28 | 2352.71 | 2464 | 4.09 | 0.337551 | 11.05 | 0.240000 | 6.360000 | 6 | 0.605096 | 462 | 3.36 | 9 | 1.623323 |
| 395 | 495 | version | 1.70 | 5.30 | 3.43 | 2.783529 | There are some cool new features in the latest version of the software. | 0 | 584.84 | 1.00 | 1.90 | 7 | 0 | 31201.95 | 6761.00 | 6934 | 4.54 | 0.189077 | 8.11 | 0.021277 | 6.127660 | 10 | 0.248599 | 13356 | 4.82 | 6 | 1.823981 |
| 396 | 496 | virtue | 1.62 | 6.70 | 4.55 | 4.181658 | Patience is undoubtedly Andrew's greatest virtue according to many. | 0 | 583.83 | 1.00 | 2.50 | 6 | 0 | 7711.64 | 338.53 | 658 | 3.51 | 0.180930 | 11.53 | 0.255319 | 5.978723 | 6 | 0.418422 | 12396 | 4.79 | 8 | 1.864383 |
| 397 | 497 | whim | 1.69 | 6.16 | 4.05 | 3.437621 | They travelled to Venice on a whim and they did not have much fun. | 1 | 653.89 | 0.90 | 1.60 | 4 | 7 | 19631.62 | 3956.12 | 287 | 3.16 | 0.085501 | 10.74 | 0.000000 | 4.914894 | 7 | 0.152424 | 4809384 | 7.38 | 1 | 1.945951 |
| 398 | 498 | window | 4.86 | 6.47 | 3.27 | 5.545691 | The student was staring out of the window looking at the clouds. | 0 | 514.62 | 1.00 | 2.20 | 6 | 1 | 32174.71 | 2018.09 | 13791 | 4.84 | 0.127163 | 4.74 | 0.957447 | 6.489362 | 8 | 0.959355 | 9418422 | 7.67 | 3 | 1.628164 |
| 399 | 499 | wisdom | 1.53 | 7.94 | 3.77 | 4.856191 | The young man showed great wisdom in the business despite his age. | 1 | 564.42 | 0.98 | 2.85 | 6 | 0 | 14873.61 | 263.39 | 1684 | 3.92 | 0.123451 | 9.61 | 0.000000 | 5.723404 | 6 | 0.243341 | 199260 | 6.00 | 5 | 1.867304 |